Why 1-Bit Sigma-Delta Conversion is Unsuitable for High-Quality Applications
نویسندگان
چکیده
Single-stage, 1-bit sigma-delta converters are in principle imperfectible. We prove this fact. The reason, simply stated, is that, when properly dithered, they are in constant overload. Prevention of overload allows only partial dithering to be performed. The consequence is that distortion, limit cycles, instability, and noise modulation can never be totally avoided. We demonstrate these effects, and using coherent averaging techniques, are able to display the consequent profusion of nonlinear artefacts which are usually hidden in the noise floor. Recording, editing, storage, or conversion systems using single-stage, 1-bit sigma-delta modulators, are thus inimical to audio of the highest quality. In contrast, multi-bit sigma-delta converters, which output linear PCM code, are in principle infinitely perfectible. (Here, multi-bit refers to at least two bits in the converter.) They can be properly dithered so as to guarantee the absence of all distortion, limit cycles, and noise modulation. The audio industry is misguided if it adopts 1-bit sigma-delta conversion as the basis for any high-quality processing, archiving, or distribution format to replace multi-bit, linear PCM. 0. INTRODUCTION This paper is an enlarged and extended version of [1], and its findings regarding 1-bit sigma-delta modulators are explored in greater detail in an associated paper [2]. In the past twenty or so years we have seen the multi-bit converter technology used in professional and consumer equipment progress from 14, through 16 and 18, to 20 or more bits of resolution. Indeed, the 16-bit linear PCM format became enshrined in the CD standard, and was the basis of most digital audio storage devices for many years. All analogue-to-digital and digital-to-analogue conversions and intermediate digital signal processing steps were performed in the linear, multi-bit PCM format, using internal processing wordlengths greater than the desired final numerical precision. One primary benefit of this format is the fact that such systems can be rendered completely linear, with infinite resolution below the least significant bit (LSB), by the adoption of proper dithering at each quantizing, or (in the case of editing and signal processing) at each requantizing, stage. Such dithering, with the optimal triangular probability density function (TPDF) dither, in principle completely LIPSHITZ AND VANDERKOOY WHY 1-BIT SIGMA-DELTA CONVERSION IS UNSUITABLE AES 110 CONVENTION, AMSTERDAM, NETHERLANDS, 2001 MAY 12–15 2 eliminates all distortion, noise modulation, and other signaldependent artefacts, leaving a storage system with a constant, signalindependent, and hence benign noise floor (see [3] and [4]). This is now well understood, and such practices have been the norm in the industry for over a decade. In practice, of course, no actual analogue realization can achieve this theoretical perfection, but in the digital domain the departure from perfection can indeed be zero due to the numerical precision of the arithmetical operations involved. In recent years, we have seen the consumer audio industry perform a remarkable feat of salesmanship by proclaiming that 1-bit converters are better than multi-bit converters, and succeeding in marketing 1bit products as preferable for the highest-quality performance. The original primary motivation for pursuing the 1-bit converter architecture was not superior performance, but rather the fact that it is cheaper to manufacture, consumes less power, and can operate well at the voltages used in battery-powered portable equipment. This has now become secondary, as 1-bit converters are currently used in consumer audio equipment at all price and quality levels. The manufacturers of high-quality converters struggled mightily to produce 1-bit devices that met the performance goals of the industry. But, they could never eliminate all the undesirable artefacts of such converters, and after more than a decade of trying, they came to the realization that they could produce better performance by using multi-bit converter architectures in their products. The one inherent advantage of the 1-bit architecture, namely its avoidance of the levelmatching difficulties found in multi-bit converters, turned out not to be as significant a benefit as one might have thought. If one examines the current data-sheets of all the major high-quality converter manufacturers, one finds that they have almost universally given up on the 1-bit sigma-delta topology in favor of oversampling converters using more than two levels. Such converter architectures can avoid the intractabilities of both the 1-bit and the 20+ -bit designs. They can be properly dithered, and can thus be guaranteed to be free of low-level, limit-cycle oscillations (“birdies”). Moreover, they do not suffer from the high-level instability problems of the higher-order, 1-bit sigma-delta converters. In light of the above, it is with alarm that we note the adoption of the single-stage, 1-bit sigma-delta converter architecture as the encoding standard for a next-generation (and supposedly higher-quality) consumer digital audio format. We refer, of course, to the Direct Stream Digital (DSD) encoding which forms the basis of the Super Audio CD format introduced recently by Philips and Sony (see, for example, [5] and [6]). The original intention to have the digital audio data at every stage of the processing — from the original analogueto-digital conversion, through all the editing and mastering operations — stored in the DSD 1-bit format has apparently now been abandoned. This was a wise decision. The conversion to the final 1-bit DSD format, however, still represents a required, and quite unnecessary, degradation of the quality of the audio signal. Every single 1-bit data conversion entails an inevitable loss of signal quality in a way which need not occur with multi-bit, linear PCM. The original rationale for storing a 1-bit DSD format signal on the Super Audio CD has now entirely vanished. The analogue-to digital and digital-to-analogue conversions, and all intermediate digital signal processing, will likely be done using multi-bit converters and storage formats. There really is no point in degrading the signal, by squeezing it onto a 1-bit Super Audio CD for transmission to the consumer, only to have it converted back to multi-bit PCM in the process of being played back. We shall now explain our reasoning in detail. 1, 2 Trademarks of Philips Electronics NV and Sony Electronics Inc. 1. MULTI-BIT VERSUS 1-BIT CONVERTERS In a normal multi-bit digital audio system, the intention is that the quantizer (i.e., essentially the number system) is never deliberately driven into saturation. Because one has enough levels available, avoiding saturation is not a significant problem in practice. Moreover, there is no problem in devoting a few LSBs of headroom to ensuring that quantization errors are properly dithered. In straight linear PCM encoding, the proper (i.e., TPDF) dither spans precisely two LSBs. For example, in a straight 16-bit system, the dither occupies only two out of the 65,536 levels available. This causes a negligible reduction in system headroom in return for all the acknowledged benefits of properly-dithered signal manipulation. If one wishes to reduce the data word-length used, one can recover the lost signal-to-noise ratio by a combination of oversampling and noise shaping. Alternatively, one can increase the system’s signal-to-noise ratio by the use of oversampling and/or noise shaping, while leaving the word-length unchanged. Noise shaping allows one to increase the signal-to-noise ratio in the audio band at the expense of decreasing it at frequencies above the audio band. One can even use in-band noise shaping without oversampling to significantly increase the perceived signal-to-noise ratio (see [7] and [8]). As long as the quantizer inside the noise shaper does not saturate, and is properly dithered, one is guaranteed that this process is completely transparent, in that it is totally distortion free. Noise shaping entails negative error feedback around the quantizer. In a noise shaper, a filter H(z) is used to spectrally shape the quantization error E. Fig. 1 shows the architecture of a simple dithered noise-shaping quantizer. Figure 1. Simple dithered noise-shaping quantizer. In this diagram, X is the input signal, N is the dither, W is the total input to the quantizer Q, and Y is the output signal. The quantization error E is extracted around the dithered quantizer (which can be multi-bit or single-bit), and subtracted from the input after passing through the noise-shaping filter H(z). H(z) can be either recursive or non-recursive. This is the error feedback loop. The signal in this loop is very small as long as the quantizer does not overload. The dither N controls the statistics of the error signal E such that, with TPDF dither, E has zero mean, constant variance, and a constant white power spectral density, independent of the input signal — indeed, E is then uncorrelated with X. This means that there is no distortion or noise modulation (see [3] and [4]). In addition, the negative feedback loop is stable as long as there is no overload, and this is easily achieved with a multi-bit quantizer Q. The theory of such dithered noise shapers can be found in [7], [8], and [9] for example. In a sampled-data realization, the z-transforms of the input, X(z), output, Y(z), and error, E(z), are related by Y(z) = X(z) + {1 – H(z)}⋅E(z). LIPSHITZ AND VANDERKOOY WHY 1-BIT SIGMA-DELTA CONVERSION IS UNSUITABLE AES 110 CONVENTION, AMSTERDAM, NETHERLANDS, 2001 MAY 12–15 3 The signal thus passes through the system unchanged, and the quantization error E(z) appears at the output shaped by the effective noise-transfer function {1 – H(z)}, to become the system’s total error {1 – H(z)}⋅E(z). Proper TPDF dither N controls the statistical properties and power spectrum of the error signal E, and hence controls the power spectrum of the shaped output error {1 – H(z)}⋅E(z). For stable operation, and the least possible noise for a given noise-shaping curve, the equivalent noise-shaping filter {1 – H(z)} must be minimum phase. Of course, for computability, H(z) must incorporate at least a single sample delay. It should be noted that, in the absence of proper dither in Fig. 1, the circuit exhibits not only the expected signal-dependent quantization distortions and noise modulations, but also low-level limit-cycle oscillations, because of the nonlinearity Q within the feedback loop. These “birdies” are input dc-offset dependent, and are frequency modulated by the audio signal. They can be quite pernicious and audible, and are an artefact of undithered noise shapers in general, but are completely eliminated by proper dithering. We shall consider quantizers Q(W) of the mid-riser type shown in Fig. 2, since this characteristic is most appropriate for the 1-bit case, which we shall shortly be considering. Figure 2. Mid-riser quantization characteristic adopted. In Fig. 2, the size of the LSB is represented by ∆, so that the quantized output levels are ±∆/2, ±3∆/2, ±5∆/2, etc. In an N-bit quantizer, there are 2 levels (i.e., LSBs). In a 1-bit quantizer, however, there are only the two indicated central levels present, namely ±∆/2. At this point one should note a couple of very important facts: 1) If the total input W to the quantizer always lies in the range –∆ ≤ W < ∆, no additional output levels beyond ±∆/2 will be called upon, and a 1-bit quantizer will behave just like a multi-bit one. Under these conditions, the full theory of dithered multi-bit quantizers can be applied to deduce the system’s behaviour. If W lies outside this range, however, the 1-bit quantizer overloads (i.e., saturates), and the multi-bit theory breaks down. 2) The noise-shaper circuit of Fig. 1 is functionally completely equivalent to the single-stage sigma-delta converter, which forms the heart of the DSD 1-bit encoder of the Super Audio CD. Simple circuit transformations allow one to convert the one configuration into the other. This is exhibited in Fig. 3, which shows in (a) the most general noise-shaper topology, and in (b) its equivalent sigma-delta form. In contradistinction to the error feedback of the noise-shaper topology (a), the equivalent sigma-delta topology (b) could be said to represent straight negative feedback. (To obtain the most-frequently used sigma-delta structure, where the filtering is done solely in the forward path, we set F ≡ 1, and the forward-path filter then becomes {G/(1 − G)}.) The advantage of looking at the circuit as a noise-shaper is that it is easier to understand than the sigma-delta circuit. Moreover, the error signal E is explicitly available in the former topology, but is implicit in the latter. We shall use the basic noise shaper circuit of Fig. 1 for the experiments to follow. Everything we have to say about the noise shaper thus applies equally to the corresponding sigmadelta converter under the transformation shown in Fig. 3. If the latter has enough bits so that it does not overload when properly dithered, it can thus in principle be perfect. That it must misbehave when it has only two levels is what we want to prove. Figure 3. Showing the general equivalence of the noise-shaper (a) and sigma-delta (b) topologies. We claim that a 1-bit sigma-delta converter must overload when properly dithered. This follows at once, since the TPDF dither N, which is needed to fully linearize the quantizer, itself swings the quantizer’s input W over its full no-overload range of ±∆. This is illustrated in Fig. 2. To obtain the total quantizer input W, we must add to N both the input signal X and the fed back error signal {−HE} (see Fig. 1). Now, the dither samples are statistically independent of the other signals in the loop, and so clearly the quantizer’s total input W has to produce overload, and its consequences — distortion, noise modulation, and instability — if there is any input or feedback. One might think that one could maintain most of the benefits while preventing the overload caused by using full TPDF dither, by using only partial dithering and/or a reduced maximum input signal level. This is true. But, reduction in the maximum signal level is undesirable because of its direct impact on the signal-to-noise ratio; LIPSHITZ AND VANDERKOOY WHY 1-BIT SIGMA-DELTA CONVERSION IS UNSUITABLE AES 110 CONVENTION, AMSTERDAM, NETHERLANDS, 2001 MAY 12–15 4 and, as we shall see, the needed reduction in dither level is so great that its main remaining benefit is the prevention of limit cycles and noise modulation, and not the reduction of distortion. In the special case of the 1-order sigma-delta modulator, one can prove that there exists a limited range of dither levels and of input signal levels which guarantees the absence of quantizer overload. Such results are more elusive in the higher-order case. These facts are demonstrated mathematically in the Appendix, to which we refer the interested reader. The important point is that full TPDF dither is never allowable in a 1-bit noise shaper or sigma-delta modulator, and hence full linearity is never achievable either in principle or, of course, in
منابع مشابه
An Overview of Sigma-Delta Converters: How a 1-bit ADC achieves more than 16-bit resolution
This article briefly describes conventional A/D conversion, as well as its performance modeling. We then look at the technique of oversampling, which can be used to improve the resolution of classical A/D methods. We discuss how sigma-delta converters use the technique of noise shaping in addition to oversampling to allow high resolution conversion of relatively low bandwidth signals. Next, we ...
متن کاملMash 2-1 Multi-bit Sigma-delta Modulator for Wlan
This paper mainly explores how oversampling and feedback can be employed in high-resolution modulators to extend the signal bandwidth into the range of megahertz, where oversampling ratio is constrained. A 2-1 cascaded multi-bit architecture suitable for broad-band applications is presented, and a linearization technique referred to as partitioned data weighted averaging (DWA) is introduced to ...
متن کاملDelta-Sigma Modulators using Frequency- Modulated Intermediate Values
This paper describes a new firstand second-order delta-sigma modulator concept where the first integrator is extracted and implemented by a frequency modulator with the modulating signal as the input. The result is a simple delta-sigma modulator with no need for digital-to-analog converters, allowing straight-forward multi-bit quantization. Without the frequency modulator, the circuit becomes a...
متن کاملAn Overview of Sigma-Delta Converters - IEEE Signal Processing Magazine
A lthough real world signals are analog, it is often desirable to convert them into the digital domain .using an analog to digital converter (ADC). Motivating designers to apply this conversion is the efficient transmission and storage of digital signals. Digital representation of an audio signal, for example, allows CD players to achieve virtually error free storage using optical disks [l]. In...
متن کاملLinearising Sigma-Delta Modulators Using Dither and Chaos
Recent work has shown that high-order single-bit sigma-delta modulators suffer from lowlevel artifacts such as idle tones and noise modulation. Techniques that have been proposed to reduce or eliminate these errors include the application of dither inside the one-bit quantiser loop, and selecting a loop filter which makes the modulator chaotic. This paper compares the efficacy of these two appr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001